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Abstract 

A recently proposed method of constructing seismic networks from "record breaking events" 
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fSJ I from the earthquake catalog of California (Phy. Rev. E, 77 6,066104, 2008) was successfull in 

■ establishing causal features to seismicity and arrive at estimates for rupture length and its scaling 
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with magnitude. The results of our implementation of this procedure on the earthquake catalog 
of Japan establishes the robustness of the procedure. Additionally, we find that the temporal 



• distributions are able to detect heterogeneties in the seismicity of the region. 
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I. INTRODUCTION 



Earthquakes are a prime example of a natural phenomenon- where the spatial and tem- 
poral coordinates of an event can be measured with a fair degree of accuracy even as the 
dynamics of the process is not fully understood. The unpredictability of an earthquake 
occurence has not been reduced by more numerous or more accurate measurements. The 
difficulties of seismic analysis involves understanding the roles of the static/geometrical het- 
erogeneities in fault zones, the dynamic heterogeneities that result as stress builds up in the 
moving lithospheric plates and the possible interactions between the two The multidi- 
mensional character of seismicity makes it difficult to model it and this is further hampered 
by the restrictions of having to apply our understanding of rock friction from lab based 
results to the enormous length and time scales associated with earthquakes. In order to 
develop a realistic model for seismicity and for assessing the seismic potential of a region, 
the factors mentioned above needs to be estimated correctly. This will involve being able 
to measure the stress and strain at all points along the active fault - which is currently 
impossible ]^ . 

The observed clustering of earthquakes suggests that events are correlated in space-time. 
Thus, studies of earthquake correlations are important for understanding the dynamics of 
the process and for evolving prediction algorithms. In recent times, a large body of work in 
seismic studies involves relating the topology of complex networks constructed from earth- 
quake catalogs of a region to the spatio-temporal patterns exhibited by seismicity there. 
Such an approach is based on the idea that the patterns of seismicity may shed light on 
the fundamental physics, since these patterns are the emergent processes of the underlying 
many body nonlinear system j^. New patterns in the clustering of seismicity in space and 
time, new parameters that characterise the seismicity in a region, causal connections be- 
tween subsequent events, novel methods to estimate the rupture length and its scaling with 
magnitude - are some of the prominent outcomes of these catalog based studies ( see for eg. 
3l-ll0|). The central idea contained in these studies is to treat all events in the catalog on 
the same footing. That is, the space - time window based distinctions of events as foreshocks 
- mainshocks - aftershocks is avoided. Causal connections are extended to beyond immedi- 
ately subsequent events and also an event can have more than one correlated predecessor. In 
these studies, correlations between events have been established from different perspectives 
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and complex networks are constructed by linking correlated events. For example, the region 
studied may be divided into small cubic cells. Each cell becomes a node in the network if an 
event happens within it. The network is linked by directed edges between nodes representing 
successive events in the catalog js, 7, 8|. Alternately, correlations have been quantified with 
a metric which incorporates the fractal dimension df and the b value of the catalog used. 
Links are made between highly correlated events with directed edges 0, s], ^. 

A third procedure involves identifying record breaking recurrences of an event and linking 
each event in the catalog to its recurrences with directed edges [2], 1^. This procedure, 
illustrated on the earthquake catalog of Southern California had evolved a general method 
for inferring causal connections. The patterns exhibited by a network constructed from the 
actual catalog of the region was compared with the network of a randomly shuffled catalog 
(acausal) in order to establish causal features. In addition, this approach provides a method 
of estimating the rupture length, its scaling with magnitude and in identifying new scaling 
laws associated with the seismicity of the region. 

nn 

In this paper, we apply the procedure developed by the authors [2|, 110| on the earthquake 
catalog of Japan, in order to study the patterns exhibited by its network and to examine the 
robustness of the procedure for different seismic regions. This paper is organised as follows:- 
Section II briefly describes the construction of the network. Section III will describe the 
catalogs used for the analysis. Section IV contains the results and a discussion of it and in 
Section V we conclude. 



II. CONSTRUCTION OF THE NETWORK 



The procedure described in [10] is followed here to identify the recurrences and to con- 
struct the network. A directed network is constructed by linking each event i, (i = 1,2,3,- ■ • 
,N) in a catalog of events with a magnitude threshold of mth, with an edge pointing from 
it ( main event) to its recurrence. Each recurrence j, ( j = i+1,- ■ ■ ,N) is a record breaking 
event which is closer to the main event i than any event k in the intervening period tj — ti 
where tj > > ti. This implies that the {i + 1)*'* event is the flrst recurrence of the i^^ event 
for i = 1,2, ■ ■ ■ ,N — 1 and is always a recurrence however distant it might be. In this way, 
each event gets to have its own list of recurrences which follow it in time. This deflnition of 
a recurrence is based solely on the spatiotemporal relations between events in the catalog 
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and has no arbitrariness in its selection. Also, any event identified as a recurrence of a main 
event does not loose its status if the period of the catalog is enlarged later, as this will 
only result in the list of recurrences getting longer if the added events satisfy the condition 
of coming closer to the main event in the intervening period. The network is constructed 
by considering every event in the catalog as a node and each recurrence is represented by 
a directed edge between the pair of events, directed according to the time ordering of the 
earthquakes. The two variables associated with each edge are - kj, the distance between 
the epicentres of the two events and tij, the time interval between the them. Each node has 
an out-degree equal to its number of outward edges (the number of its recurrences) and an 
in-degree equal to the number of events it is counted to be a recurrence of. The first event in 
the catalog will not have any incoming edges to it and the last event in the catalog will have 
no outgoing edges from it. Note that the recurrences defined here do not necessarily relate 
to the conventional methods of defining aftershocks and thus the largest magnitude event 
in the network need not have the most number of recurrences. The overall structure of the 
network represented by the outgoing links and the incoming links describes the clustering 
of seismicity in the region studied. 



III. THE CATALOG 



The catalog studied here is the Japan University Network Earthquake Catalog Uj cover- 
ing the rectangular region (126.433°E-148.0°E) longitude and (25.730°N-47.831°N) latitude, 
for the period between January, 1986-December, 1998. The catalog has a threshold magni- 
tude of mjMA =2.0. However, we have hmited our study to events with magnitude > 2.5. 
The total number of such events are 114804. In order to verify the dependance of the net- 
work properties on magnitude threshold and on the number of events, subsets with higher 
magnitude thresholds viz 3.0, 3.5, 4.0 and 4.5 which contain 53525, 20067, 6268 and 2038 
events respectively and a subcatalog with irtth = 3.0 for a shorter period from 1986 -1992 
with 22975 events, were generated from the main catalog. The h value = —0.87 was obtained 
from a plot of log N{m) vs m, where N{m) is the cumulative number of earthquakes of mag- 
nitudes > m, and the linear fit was good in the range 2.5 < m < 7. In order to distinguish 
the causal features of seismicity, networks were constructed from the shuffled catalog, where 
we have adopted an identical shuffling procedure as in j^]. Subsets and sub catalogs of 
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the shuffled catalog are also made for the same rrith values as for the actual catalog. We 
propose to contrast the features of their in/out-degree, recurrent length and recurrent time 
distributions in order to relate the features of the seismic network of Japan with that of 
Cahfornia 0. 



FIG. 1: A spatial plot of the earthquake catalog ll|] with mth =2.5 for the period 1986 -1998. The 
catalog contains 114804 events. 
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IV. RESULTS 

A. Degree distributions 

The in-degree and out-degree of every node in each of the networks of different rrith 
is estimated and the histograms in Fig. [2](a) show the degree distributions for the actual 
catalog. The points and the lines of the same colour represent the out-degree and the in- 
degree distributions respectively, for each rrith value. As observed for California 10|], the 
in-degree histograms are all Poisson in nature while the out-degrees show an excess of nodes 
with lesser and higher degree when compared to a Poisson distribution of the same mean 
degree. The panel (6) in Fig. [2] represents the in/out-degree histograms for the shuffled 
catalog which is equivalent to random process. The in-degree for the shuffled catalog is 
observed to be a Poisson distribution. In a random network, the out-degree distributions 
are also expected to be Poisson. The shuffled catalog has an out-degree which though not a 
typical Poisson distribution, shows significant differences when compared to the out-degree 
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FIG. 2: The panels (a) and (b) in the figure represent the degree distributions for the actual and 
the shuffled catalogs respectively. The points represent the out-degree and lines of the same colour 
as the points represent the corressponding in-degree distributions for the different rrith values shown 
in the legend of the graph. The in-degree, for all rufh values is almost a Poisson distribution for 
both the actual and the shuffled catalogs. The out-degree of the actual catalog shows significant 
deviations from Poisson behaviour for low and higher degree values. In case of the shuffled catalog, 
the marginal deviation from Poisson behaviour is only towards the higher degree values. 
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distributions of the actual network especially for the lower degree values. For each network, 
the mean in-degree is equal to the mean out-degree. As expected, this decreases with rrith 
because the number of nodes and therefore, the number of links decrease with rrith- For a 
Poisson distribution, it is expected that (k) ^ log A^, which is what we observe in Fig. [3] 
for the shuffled catalog. For the actual catalog however, the number of links are less than 
that of a random network and we get (k) ^ 0.91 log A^. The value 0.91 is higher than that 
reported for California in [lOj and this means that larger number of links are possible for 
the nodes in the Japanese seismic network. 



B. Distance distributions 



We use the great circle distance for computing the distance kj between two event lo- 



12|: if 



cations. This is calculated using the Haversine formula 
(latitude, longitude) values for the two event locations and, A0 = (pj 
then 



Aj) and {(f)j,Xj) are the 
)j and A A = Aj — Xj, 



RAa 
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FIG. 3: The plot shows the variation of the mean degree, {k) with the number of events N . The 
hnes represent the best fits from which the slope ie estimated. The shuffled catalog, which is 
equivalent to a random process, shows a slope = 1 while the actual catalog shows a lower value of 
^ 0.91. 
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where Aa, the spherical distance, or central angle, between the two event locations is 



Ao" = 2 arcsin \ / sin^ 



I + cos(0jj cos(0jj sm 



AA^ 



i?, the radius of the earth, is taken as 6367 km. The PDF of the recurrent distances for all 
the i — j recurrent pairs in the actual network is shown in a log — log plot of Pm{^) versus 
/ in Fig. m^a). The different coloured points are for networks with different mj/^ values. All 
the plots show that the PDF is unimodal and exhibits a peak at a typical distance l^. The 
position of the peak is invariant with respect to the length of the catalog as can be seen 
from the peak positions of the green and black points, which corresspond to different catalog 
sizes having the same value of 3.0. Though we have not shown it in the figure, we have 
verified this for the other irith values also. The peak position however, depends upon nith 
and increases with it. Beyond the peak the distribution shows a power law decrease with an 
exponent 1.2 upto a cut-off which represents the size of the region studied. Since the least 
count of the catalog for latitude and longitude values is 0.001°, which translates to a least 
measurable distance ~ 0.1km, we are unable to resolve the points at distances < lOOmts. 
This affects the unnambiguous recognition of the peak for the lower mth plots. The position 
of the peaks for each plot (Zj^) was estimated by fitting a quadratic near the region of the 
peak. The variation of with m was studied to arrive at the rescaling parameter of 0.41 
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FIG. 4: The figure shows the distribution of the distances of recurrences for the different magnitude 
thresholds. The panel marked (a) is for the actual catalog and the one marked (b) for the shuffled 
catalog. The black points in both the plots represent a smaller catalog(1986 -1992) with rrith = 
3.0. For the actual catalog, the size of the catalog does not affect the position of the peak as seen 
in the peak positions of the green and black points. However, for the shuffled catalog, the distance 
distribution varies with size of the catalog. The inset shows the data collapse achieved by rescaling 
the distances and the distributions according to eqn. [TJ The red line in the inset represents the 
pre- factor Lq in the scaling law for the rupture length given by l*{m) = Lq x lo'^-^^"^*'' , where Lg 
= 0.066 km 




for the inset data collapse in Fig. Hl^a). The data collapse obeys the relation 

P"(/) ^ /-i-2f(//10°-^^'"*^) (1) 



The scaling function F has two regimes- a power law increase with an exponent 1.8 for 
small arguments and a constant regime at larger arguments. The characteristic distance of 
the peaks can be written as 

/*(m) =LoX 10°-^^™*'^ (2) 

where Lq = 0.066. The distance distributions for the shuffled catalog in Fig. HJ^b) shows the 
same overall shape as for the actual catalog. We also find that the peaks shift to higher 
values of / as rrith increases. However, one important variation observed here was that the 
position of the peak is not independant of A^. This striking and important feature was 
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reported for the California catalog too [10|. The invariance of for the actual catalog 
was attributed to the causal structure of seismicity and is therefore expected to reflect the 
features of the underlying dynamics [lo| . 

1. Identifcation with rupture length 

An useful outcome of constructing such a network of recurrent events is, being able to 
arrive at an estimate for the rupture length in the region, from its earthquake catalog. The 
values of are identified as the rupture lengths and in the case of the California region the 
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values obtained were reported to be in good agreement with the estimates given in [13|, 
The identification of as the rupture length is further established by the appearance of the 
same distances in the distribution of the distances of the first recurrences. The distribution 
of the first recurrences for California had the form 

P™(/) = r'^'-F(//10°-^5'""^) (3) 

with Sr = 0.6. The first recurrence of an event i is the distance between the i*'* and {i + 1)*'^ 
event. The distances of the first recurrence is estimated for all the events in the catalog and 
its distribution for the actual catalog is shown in figJSl We also observe the peaks for the 
different mth values appearing at the same distances as in fig. [Hand therefore the /I values 



estimated from fig. H] can be considered as the rupture lengths as indicated in 10|]. The 
distribution of first recurrences in our analysis can be written as 

P™(/) = /-^'■F(//10°-^^™*'') (4) 

with 6r = 0.6 and the scaling function F has the same form as F in fig. |H 

The rupture lengths for Japan estimated using eqn. [2] compares favourably with the values 

I I 

estimated using L^(m') ^ 0.02 x lO^-^""' llSi] and L^(m') ^ 0.018 x iQO-^e™' [ij] as seen 
from Table [T] 
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TABLE I: The values tabulated here show a comparison of the rupture lengths estimated by this 
procedure with other estimates in 13, llJ] • Note that the magnitude scale used in 13|, [ij] are 
moment magnitude (Mw)- We have used the relation = 0.58Mjma + 2.25 [is'] in order to 
convert the JMA magnitude values to its equivalent moment magnitude. 
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FIG. 5: The distribution of distances of the first recurrence for the different rrith values is shown 
here. The peaks at the characteristic distances that was observed in fig. U] is visible here too. The 
inset shows the datacollpase achieved by rescaling the distances and the distributions according to 
eqn. HI 
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C. Temporal distribution of recurrences 

The PDF of the time intervals tij, between events and their recurrences for different rrith 
values is shown in a log — log plot in fig. |6]^a) for the actual catalog and in the panel marked 
(b) for the shuffled catalog. As observed for the California region, for intermediate times the 
distributions show a power law decay with an exponent ^1.0 with an observational cut-off 
at the longest time scales. The distributions for the different rrith values are all collapsed 
showing no dependance on magnitude in this region. However, in the region of the shorter 
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FIG. 6: The distributions of the recurrent times for different values is shown here in panels 
(a) and (6) for the actual and shuffled catalogs respectively. Significant differences can be observed 
between the two distributions. However, the dependence of the distributions on m in the actual 
catalog is , a major departure from what is expected from a causal network IC]. 
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time scales, the different distributions show a dependance on m. The power law exponent 
in this region is much less than 1.0 and varies from —0.3 to —0.45 as rrith increases from 
2.5 to 4.5. The range over which this smaller exponent holds good depends on rrith and 
increases with it. This aspect of the recurrent time distributions is at variance with that 
observed for California, where the temporal distributions were found to be independent 
of m for the actual catalog and showed a single power law regime over almost the entire 
range. This behaviour was attributed to the causal features of seismicity. The recurrent 
time distributions for the shuffled catalog in fig. [6 b) shows a more prominent dependence 
on m, which is expected for a random network [lO|. That the actual catalog for Japan too 
shows characteristics attributed to an acausal network signifies a random behaviour in the 
temporal distributions in the case of Japan. 



V. CONCLUSIONS 



The general procedure that was introduced to infer causal structure from clusters of lo- 
calised events was illustrated on the earthquake catalog of California. Our results from 
adopting the procedure on the earthquake catalog of Japan confirms its robustness. The 
construction of the network involves establishing correlations between nodes without making 
any assumptions regarding the nature of the correlations. The distinctive features of the 
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seismic network when compared to an acausal network was attributed to the causal con- 
nections between events. Our resuhs show that features of random networks are universal 
with the shuffled catalogs of both the regions showing identical network properties. The 
network from the actual catalog on the other hand shows some region specific features along 
with exhibiting typical causal attributes. The estimates of rupture length which arise as 
natural outcomes of the network construction are in good agreement with values estimated 
by other methods. The anamolies that are shown by the recurrent time distributions in the 
case of Japan when compared to expectations for a causal network could be a result of the 
heterogenous nature of seismicity in the region. Japan experiences a high level of seismicity 
due to its position at the cusp of the Pacific-Philippine-Eurasian triple plate junction. It 
has a predominantly compressional tectonic regime as compared to Western United States 
which is dominantly extensional 16|. We expect this aspect, along with the fact that dif- 



ferent recurrence intervals exist in Japan for earthquakes in subduction zones as compared 
to areas along crustal faults, to influence the recurrent time distributions which is shown in 
our results here. However analysis of catalogs for other such regions would have to be done 
in order to conflrm this stand. 
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